Event-Based Clustering for Reducing Labeling Costs of Event-related Microposts
نویسندگان
چکیده
Automatically identifying the event type of event-related information in the sheer amount of social media data makes machine learning inevitable. However, this is highly dependent on (1) the number of correctly labeled instances and (2) labeling costs. Active learning has been proposed to reduce the number of instances to label. Albeit the thematic dimension is already used, other metadata such as spatial and temporal information that is helpful for achieving a more fine-grained clustering is currently not taken into account. In this paper, we present a novel event-based clustering strategy that makes use of temporal, spatial, and thematic metadata to determine instances to label. An evaluation on incident-related tweets shows that our selection strategy for active learning outperforms current state-of-the-art approaches even with few labeled instances.
منابع مشابه
Event-based Clustering for Reducing Labeling Costs of Incident-Related Microposts
Automatically identifying the event type of event-related information in the sheer amount of social media data makes machine learning inevitable. However, this is highly dependent on (1) the number of correctly labeled instances and (2) labeling costs. Active learning has been proposed to reduce the number of instances to label. Though, current approaches focus on the thematic dimension, i.e., ...
متن کاملth Workshop on Making Sense of Microposts ( # Microposts 2015 ) Big things
Detecting events using social media such as Twitter has many useful applications in real-life situations. Many algorithms which all use different information sources—either textual, temporal, geographic or community features—have been developed to achieve this task. Semantic information is often added at the end of the event detection to classify events into semantic topics. But semantic inform...
متن کاملTESTING FOR “RANDOMNESS” IN SPATIAL POINT PATTERNS, USING TEST STATISTICS BASED ON ONE-DIMENSIONAL INTER-EVENT DISTANCES
To test for “randomness” in spatial point patterns, we propose two test statistics that are obtained by “reducing” two-dimensional point patterns to the one-dimensional one. Also the exact and asymptotic distribution of these statistics are drawn.
متن کاملClustering of Gabor Atoms Describing Event-Related Potentials - Solution for ERP Detection Algorithm based on Matching Pursuit when ERP Waveform is Approximated by Two or More Gabor Atoms
In our research group, we also focus on methods for automatic detection of event-related potentials in the EEG signal. We published the algorithm for event-related potential detection based on the matching pursuit algorithm in one of our previous papers. As usual, this method does not work well under special circumstances which can occur (it is a situation when the waveform of event-related pot...
متن کاملEvaluation of self-esteem in children with attention-deficit/hyperactivity disorder based on event-related potential
Background: Self-esteem, the value we place on ourselves, has been associated with effects on health, and life satisfaction. Many studies reported that children with attention-deficit/hyperactivity disorder (ADHD) suffer from low self-esteem has been associated with negative life outcomes. The present study investigated neural correlation of self-esteem in this group compared with typically dev...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2015